Digging for Names in the Mountains: Combined Person Name Recognition and Reference Resolution for German Alpine Texts

نویسندگان

  • Sarah Ebling
  • Rico Sennrich
  • David Klaper
چکیده

In this paper we introduce a module that combines person name recognition and reference resolution for German. Our data consisted of a corpus of Alpine texts. This text type poses special challenges because of a multitude of toponyms, some of which interfere with person names. Our reference resolution algorithm outputs person entities based on their last names and first names along with their associated features (jobs, addresses, academic titles). DOI: https://doi.org/10.1007/978-3-319-08958-4_16 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-50451 Accepted Version Originally published at: Ebling, S; Sennrich, R; Klaper, D; Volk, Martin (2011). Digging for names in the mountains: Combined person name recognition and reference resolution for German alpine texts. In: 5th Language Technology Conference, Poznan, Poland, 25 November 2011 27 November 2011. DOI: https://doi.org/10.1007/978-3-319-08958-4_16 Digging for Names in the Mountains: Combined Person Name Recognition and Reference Resolution for German Alpine Texts Sarah Ebling, Rico Sennrich, David Klaper, Martin Volk Institute of Computational Linguistics, University of Zurich Binzmühlestrasse 14, 8050 Zurich, Switzerland {ebling,sennrich,volk}@ifi.uzh.ch, [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Challenges in Building a Multilingual Alpine Heritage Corpus

This paper describes our efforts to build a multilingual heritage corpus of alpine texts. Currently we digitize the yearbooks of the Swiss Alpine Club which contain articles in French, German, Italian and Romansch. Articles comprise mountaineering reports from all corners of the earth, but also scientific topics such as topography, geology or glacierology as well as occasional poetry and lyrics...

متن کامل

تشخیص اسامی اشخاص با استفاده از تزریق کلمه‌های نامزد اسم در میدان‌های تصادفی شرطی برای زبان عربی

Named Entity Recognition and Extraction are very important tasks for discovering proper names including persons, locations, date, and time, inside electronic textual resources. Accurate named entity recognition system is an essential utility to resolve fundamental problems in question answering systems, summary extraction, information retrieval and extraction, machine translation, video interpr...

متن کامل

سیستم شناسایی و طبقه بندی اسامی در متون فارسی

Name entity recognition (NER) is a system that can identify one or more kinds of names in a text and classify them into specified categories. These categories can be name of people, organizations, companies, places (country, city, street, etc.), time related to names (date and time), financial values, percentages, etc. Although during the past decade a lot of researches has been done on NER in ...

متن کامل

Classifying Named Entities in an Alpine Heritage Corpus

In the project “Text+Berg" we digitize and archive the heritage of alpine literature from various European countries. In a first step our group digitizes all yearbooks of the Swiss Alpine Club from 1864 until today. The books comprise articles in German, French and Italian, a total of around 100.000 pages. This paper describes the corpus and the project phases towards its digitalization. We the...

متن کامل

A new nomenclature for fungi

Important changes brought about by the Melbourne International Code of Nomenclature for Algae,FungiandPlantsare briefly reviewed concerning a clarification of the spelling and typification of sanctioned fungal names, the recognition of electronic publication for the validity of nomenclatural novelties, permission to use English diagnoses or descriptions for their valid publication, and the requ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011